DDDAS/ITR: A Data Mining and Exploration Middleware for Grid and Distributed Computing
نویسندگان
چکیده
We describe our project that marries data mining together with Grid computing. Specifically, we focus on one data mining application the Minnesota Intrusion Detection System (MINDS), which uses a suite of data mining based algorithms to address different aspects of cyber security including malicious activities such as denial-of-service (DoS) traffic, worms, policy violations and inside abuse. MINDS has shown great operational success in detecting network intrusions in several real deployments. In sophisticated distributed cyber attacks using a multitude of wide-area nodes, combining the results of several MINDS instances can enable additional early-alert cyber security. We also describe a Grid service system that can deploy and manage multiple MINDS instances across a wide-area network.
منابع مشابه
Towards Dynamic Data-Driven Management of the Ruby Gulch Waste Repository
Previous work in the Instrumented Oil-Field DDDAS project has enabled a new generation of data-driven, interactive and dynamically adaptive strategies for subsurface characterization and oil reservoir management. This work has led to the implementation of advanced multiphysics, multi-scale, and multi-block numerical models and an autonomic software stack for DDDAS applications. The stack implem...
متن کاملA new Approach for Selecting Best Resources Nodes by Using Fuzzy Decision Tree in Grid Resource Broker
Nowadays, Grid Computing has been accepted as an infrastructure to perform parallel computing in distributed computational resources. Grid has users, resources, and an information service (IS). Resource broker service is one of the main services in grid to find resources, filter resources, allocate resources, etc. Resource selection is part of resource broker that is an important issue in a gri...
متن کاملSPHINX: a scheduling middleware for data intensive applications on a grid
A grid consists of high-end computational, storage, and network resources that, while known a priori, are dynamic with respect to activity and availability. Efficient coscheduling of requests to use grid resources must adapt to this dynamic environment while meeting administrative policies. We discusses the necessary requirements of such a scheduler and introduce a distributed framework called ...
متن کاملDIGIDT: Distributed Classifier Construction in the Grid Data Mining Framework GridMiner-Core
Grid Data Mining denotes efforts to utilize data mining and knowledge discovery techniques leveraging the largescale computational and storage power offered by Grid infrastructures. Data preprocessing and data mining algorithms are known to be both compute and data intensive and therefore appear to be ideal pilot applications to test whether Grid toolkits hold what they promise. This paper desc...
متن کاملDistributed Mining of Molecular Fragments
In real world applications sequential algorithms of data mining and data exploration are often unsuitable for datasets with enormous size, high-dimensionality and complex data structure. Grid computing promises unprecedented opportunities for unlimited computing and storage resources. In this context there is the necessity to develop high performance distributed data mining algorithms. However,...
متن کامل